104 results found.
Language Type:
Multilingual
Languages:
Hindi
Availability:
<Not Specified>
License:
Open Source
Size:
327087 words Production Status:
Existing-used
Use:
Multiple NLP tasks
-
Paper title:Improvised and Adaptable Statistical Morph Analyzer (SMA++)
-
Paper track:Short Paper
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Saikrishna Srirampur | IIIT Hyderabad | IN |
| Author 2 | Deepak Kumar Malladi | IIIT Hyderabad | IN |
| Author 3 | Radhika Mamidi | IIIT-Hyderabad, Professor | None |
| Main Contact | Saikrishna Srirampur | IIIT Hyderabad | None |
Documentation:
http://www.aclweb.org/anthology/W12-56Language Type:
Multilingual
Languages:
Hindi
Availability:
Freely Available
License:
CreativeCommons ShareAlike
Size:
605 MByte Production Status:
Newly created-finished
Use:
Discourse
-
Paper title:Developing Politeness Annotated Corpus of Hindi Blogs
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Ritesh Kumar | Dept. of Linguistics, Dr. Bhim Rao Ambedkar University, Agra | IN | Centre for Linguistics, Jawaharlal Nehru University | IN |
| Main Contact | Ritesh Kumar | Dept. of Linguistics, Dr. Bhim Rao Ambedkar University, Agra | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English Hindi
Availability:
Freely Available
License:
<Not Specified>
Size:
5.79 MByte Production Status:
Newly created-finished
Use:
Question Answering
-
Paper title:MMQA: A Multi-domain Multi-lingual Question-Answering Framework for English and Hindi
-
Paper track:Evaluation
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Deepak Gupta | IIT Patna | IN |
| Author 2 | Surabhi Kumari | Indian Institute of Technology Patna | IN |
| Author 3 | Asif Ekbal | Indian Institute of Technology Patna | IN |
| Author 4 | Pushpak Bhattacharyya | CSE Department, IIT Bombay | IN |
| Main Contact | Deepak Gupta | IIT Patna | None |
Documentation:
The documentation has been provided with the corpora in English.
Written
Lexicon,
Language Type:
Multilingual
Languages:
Hindi
Availability:
Freely Available
License:
CreativeCommons: Attribution-NonCommercial-ShareAlike CC BY-NC-SA
Size:
4 MByte Production Status:
Newly created-in progress
Use:
Word Sense Disambiguation
-
Paper title:Synset Ranking of Hindi WordNet
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Sudha Bhingardive | Department of CSE, IIT Bombay | IN |
| Author 2 | Rajita Shukla | CFILT, IIT Bombay. | IN |
| Author 3 | Jaya Saraswati | CFILT, IIT Bombay. | IN |
| Author 4 | Laxmi Kashyap | CFILT, IIT Bombay. | IN |
| Author 5 | Dhirendra Singh | <Not Specified> | None |
| Author 6 | Pushpak Bhattacharya | Dept. of CSE, IIT Bombay | IN |
| Main Contact | Sudha Bhingardive | Department of CSE, IIT Bombay | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Hindi
Availability:
Not Applicable
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-in progress
Use:
Discourse
-
Paper title:Challenges in the development of annotated corpora of computer-mediated communication in Indian Languages: A Case of Hindi
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Ritesh Kumar | <Not Specified> | None | Jawaharla Nehru University, New Delhi | None |
| Main Contact | Ritesh Kumar | Jawaharlal Nehru University | IN | Jawaharla Nehru University, New Delhi | IN |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Hindi
Availability:
From Owner
License:
<Not Specified>
Size:
84K words Production Status:
Newly created-in progress
Use:
Semantic role labeling, Verb sense disambiguation
-
Paper title:Empty Argument Insertion in the Hindi PropBank
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Ashwini Vaidya | University of Colorado at Boulder | None |
| Author 2 | Jinho D. Choi | University of Colorado at Boulder | None |
| Author 3 | Martha Palmer | University of Colorado at Boulder | None |
| Author 4 | Bhuvana Narasimhan | University of Colorado at Boulder | None |
| Main Contact | Jinho Choi | University of Colorado at Boulder | US |
Documentation:
'''Analysis of the Hindi Proposition Bank using Dependency Structure'', Ashwini Vaidya, Jinho D. Choi, Martha Palmer, Bhuvana Narasimhan, Proceedings of ACL workshop on Linguistic Annotation (LAW'11), 21-29, Portland, Oregon, 2011'Language Type:
Multilingual
Languages:
Hindi
Availability:
Not Applicable
License:
<Not Specified>
Size:
210 sentences Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:Sentence Level Temporality Detection using an Implicit Time-sensed Resource
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Sabyasachi Kamila | Research Scholar IIT Patna | IN |
| Author 2 | Asif Ekbal | Indian Institute of Technology Patna | IN |
| Author 3 | Pushpak Bhattacharyya | CSE Department, IIT Bombay | IN |
| Main Contact | Sabyasachi Kamila | Research Scholar IIT Patna | None |
Documentation:
<Not Specified>
Speech
Corpus,
Language Type:
Monolingual
Languages:
Hindi
Availability:
Not Available
License:
Size:
350 hours Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Improving Low Resource Code-switched ASR using Augmented Code-switched TTS
-
Paper track:8.12 Cross-lingual and multilingual/accent aspects/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yash Sharma | Hindi Monolingual (Microsoft Proprietary) | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Multilingual
Languages:
Bengali Gujarati Hindi Kannada Malayalam Odia Rajasthani Tamil Telugu
Availability:
License:
Creative Commons
Size:
None Production Status:
Existing-used
Use:
Speech Synthesis
-
Paper title:Generic Indic Text-to-speech Synthesisers with Rapid Adaptation in an End-to-end Framework
-
Paper track:7.14 Cross-lingual and multilingual aspects in spe/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Anusha Prakash | indic TTS | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Multilingual
Languages:
Dari/Pashto Dutch English Finnish French Hindi Icelandic Indonesian Japanese Lithuanian Malay Mandarin Nepali Portuguese Punjabi Romanian Slovenian Spanish
Availability:
From Owner
License:
CreativeCommons
Size:
467 hours Production Status:
Newly created-finished
Use:
Person Identification
-
Paper title:JukeBox: A Multilingual Singer Recognition Dataset
-
Paper track:4.3 Speaker verification and identification/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Anurag Chowdhury | JukeBox | /N |
Documentation:
Documentation in English language will be made available upon publication of the dataset.




